Generating Coherent Messages in Real-time Decision Support: 
Exploiting Discourse Theory for Discourse Practice 
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Abstract 

This paper presents a message planner, TraumaGEN, 
that draws on rhetorical structure and discourse theory 
to address the problem of producing integrated messages 
from individual critiques, each of which is designed to 
achieve its own communicative goal. TraumaGEN takes 
into account the purpose of the messages, the situation 
in which the messages will be received, and the social 
role of the system. 

Introduction 

The generation of multisentential discourse has focused 
on generating text that accomplishes one particular 
rhetorical goal, such as describing a physical device. In 
contrast, to deliver real-time decision support in trauma 
management, our text generation system must be able 
to take an arbitrary and often inter-related set of com- 
municative goals and produce a message that realizes 
the entire set in as concise and coherent a manner as 
possible. This paper presents our implemented system, 
TraumaGEN, that addresses this problem. It describes 
the strategies that we have adopted to produce coherent 
integrated messages and discusses how our system takes 
into account factors such as the social role of the system. 

Real-Time Decision-Support System 

TraumAID (Webber, Rymon, and Clarke, 1992) is a de- 
cision support system for addressing the initial definitive 
management of multiple trauma. TraumaTIQ (Gertner 
and Webber, 1996) is a module that compares a physi- 
cian's plan for managing patient care with TraumAID 's 
own management plan and critiques significant differ- 
ences between them. To hypothesize the physician's 
plan, TraumaTIQ first chains to identify possible expla- 
nations for an action; it then evaluates these possible 
explanations on the basis of relevance to TraumAID 's 
current management plan and evidence provided by the 
physician's other actions. Once the best explanation(s) 
have been incorporated into the system's model of the 
physician's plan, TraumaTIQ identifies differences be- 
tween that plan and TraumAID 's current management 
plan and notifies the physician of those discrepancies 
that could seriously impact patient care. These include 
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actions that have been ordered but are not yet justified, 
procedures that are suboptimal, scheduling errors, and 
omitted actions. TraumaTIQ's critiques are conveyed 
using natural language sentences generated by filling in 
sentence schemata. 

The problem we address here is that, while in isola- 
tion each of TraumaTIQ's critiques may effectively warn 
a physician about a problem in their plan, in most cases 
when TraumaTIQ finds the physician's plan deficient, 
several problems are detected and thus multiple critiques 
are produced. We found that there was informational 
overlap among the critiques, that some critiques de- 
tracted from other ones, and that some critiques would 
make more sense if they took explicit account of those 
appearing earlier. Thus a text planner was needed to 
generate coherent and concise integrated messages that 
satisfy the multiple goals of the individual critiques. 

Text Planning for Multiple Goals 

The system we built to solve this problem, Trauma- 
GEN, is presented with several communicative goals and 
a means for achieving each goal in isolation. It uses a set 
of transformational rules to transform these into coher- 
ent message units that achieve the overall set of goals. 
These message units are then translated into natural lan- 
guage using sentence schemata. 

This transformational process differs from previous 
text planning efforts in several ways. First, Trauma- 
GEN's eventual message must achieve several top-level 
communicative goals as a unit. This differs from tradi- 
tional text planners that must satisfy a single rhetorical 
goal such as relating the temporal sequence of events in 
which a particular ship is a part (Hovy, 1991) or achieve 
a single intentional goal such as getting the user to make 
a particular change in a Lisp program (Moore and Paris, 
1993). In such cases, the text planner can construct 
a plan top-down from the single goal and include those 
propositions that fit into a coherent piece of text and con- 
tribute to achieving the top-level goal. Although Wolz 
(1990) developed a system for generating text satisfy- 
ing dual, but related, discourse goals such as respond- 
ing and enriching, her system focused on eliminating ob- 
vious or redundant information, not on producing in- 



Individual critiques produced by TraumaTIQ: 

Caution: check for medication allergies and order pulmonary care immediately to treat the left pulmonary 
parenchymal injury. 

Caution: check for medication allergies and order pulmonary care immediately to treat the compound rib 
fracture of the left chest. 

Caution: check for medication allergies and do a laparotomy immediately to treat the intra-abdominal injury. 
Caution: do a laparotomy and repair the left diaphragm immediately to treat the lacerated left diaphragm. 
Consider checking for medication allergies now to treat a possible GI tract injury. 

Merged message: 

Caution: check for medication allergies as part of treating the left pulmonary parenchymal injury, treating 
the compound rib fracture of the left chest, treating the intra-abdominal injury, and treating a possible 
GI tract injury. Then order pulmonary care to complete treating the left pulmonary parenchymal injury 
and treating the compound rib fracture of the left chest, and do a laparotomy to complete treating the 
intra-abdominal injury. 

Figure 1: Original Critiques and a Merged Message 



tegrated messages from individual, possibly conflicting, 
critiques. Similarly, the WISHFUL system (Zukerman 
and McConachy, 1995) includes an optimization stage 
during which it chooses the optimal way to achieve a set 
of related communicative goals; however, the system can 
choose to eliminate propositions and it does not have to 
deal with conflict within the information to be conveyed. 

Second, the means for achieving each of the individ- 
ual goals has already been identified by other modules. 
Thus TraumaGEN is not responsible for identifying the 
content of the message but must instead determine how 
to realize an effective overall message from the set of in- 
dividual critiques. We note that this problem is likely to 
arise elsewhere as sophisticated systems distribute their 
processing across individual modules, each of which may 
need to communicate with the user. 

Although natural language has been used in other 
health care systems such as MIGRAINE (Buchanan et 
al., 1995) and Piglet (Binsted, Cawsey, and Jones, 1995), 
their applications have not required that they combine 
several independent but inter-related text plans into a 
single integrated message. The work most closely related 
to ours is HealthDoc (Hirst and DiMarco, 1996; Wanner 
and Hovy, 1996); however, HealthDoc (currently under 
development) focuses on editing sentences selected from 
a master text, such as by inserting pronouns or by delet- 
ing references to propositions that do not appear earlier 
in the selected text. 

Constructing Effective Message Units 

The nature of trauma management and our observations 
of communication in the emergency room trauma bay 
suggested several features that should influence the gen- 
eration process: 

• Purpose: Since the purpose of messages is to sup- 
port decision-making, the system's recommendations 



should continue to be organized in terms of rele- 
vant domain goals, so that the physician can evaluate 
the system's recommendations and decide whether to 
adopt them. 

• Situation : Since the emergency room is a chaotic set- 
ting for time-critical decisions, the messages must be 
succinct, unambiguous, and easily assimilated. 

• Social role : Since the system's social role on the med- 
ical team is that of an expert consultant to the physi- 
cian who retains responsibility for the quality of pa- 
tient care, it must recognize that the physician can 
ignore its recommendations. This differs from other 
scenarios, such as tutoring, where the system is the 
sole arbiter of correct behavior. 

Our transformational rules take these factors into ac- 
count. 

Informational Overlap One prevalent problem in 
TraumaTIQ 's output is informational overlap — actions 
often appear in several different warnings and thus the 
message as a whole appears repetitious. For example, 
the upper half of Figure Q presents a set of five individ- 
ual warning messages generated by TraumaTIQ at one 
point in a case. Each warns about the omission of war- 
ranted actions; the fifth one is realized differently due 
to the lesser estimated disutility of the identified error. 
While it seems obvious that this set of comments should 
be combined into a more coherent message, it is much 
less clear which of many ways to effect the combination. 
Our approach for merging critiques is motivated by four 
often conflicting goals: 1) group by relevant treatment 
goals, 2) avoid repeated mention of the same actions, 
since this can erroneously suggest multiple instances of 
the action, 3) produce concise messages, and 4) produce 
few, rather than many, individual messages. 



(Consider User {AO, A2, A3)) 



n| I s 

(Recommend System User (AO, A2, A3|) (Persuaded User |A0, A2, A3|) 
Do AO, A2, and A3 |n 

(Motivation {A0,A2,A3), |G11) 

N 

(Bel User (Achieves | A0,A2,A3) {Gl |)) 
N 

(Infonn System User (Acliieves |A0,A2,A3| {Gl|)) 
toGl 

(Consider User |A1, A2, A3,A4)) 



(Recommend System User { A1,A2,A3,A4|) (Persuaded User | Al, A2, A3,A4)) 

Do Al, A2, A3, and A4 | N 

(Motivation { A1,A2,A3,A4|, {G21) 

I N 

(Bel User (Achieves j A1,A2,A3,A4) {G21)) 
I N 

(Inform System User (Achieves { A1,A2,A3,A4| |G21)) 
to G2 

Figure 2: Two Individual Text Plans 



Rhetorical structure theory (Mann and Thompson, 
1987) posits that a coherent text plan consists of seg- 
ments related to one another by rhetorical relations such 
as motivation or background. To address the problem of 
informational overlap, we found that it was necessary to 
draw on the multi-nuclear Sequence relation of RST. We 
posited that separate plans for similar communicative 
goals involving sets of recommended actions in the orig- 
inal messages (such as the two plans in Figure ||) could 
be reorganized as a sequence of communicative goals in 
a single plan, with the recommended actions distributed 
over the sequentially related goals (as in the plan in Fig- 
ure H), as long as the new plan captures the relation- 
ships between recommended actions and their motiva- 
tions given in the original plans. Reorganizing messages 
into sequentially related goals allowed us to construct 
merged messages that exploit informational overlap in 
the individual messages yet still achieve the goals of the 
original messages. 

Thus our first rule. Combine- Similar- Intentions, looks 
for overlap among the components of individual mes- 
sages that have a similar communicative goal (such as 
a goal of getting the physician to perform some omit- 
ted actions) and evaluates the resultant message using 
a metric that weighs 1) the number of segments that a 
goal is distributed over in the resultant message, 2) the 
reduction in repetition of actions, 3) whether goals must 
be repeated, and 4) the number of individual critiques 
that are merged. 

The first term in our metric measures how much the 
original messages were reorganized. Since the original 



messages represent an organization in terms of treatment 
goals, the more segments comprising a merged message, 
the less the message is organized in terms of how to ad- 
dress a relevant goal and the more it resembles an action 
recipe. The merged message in Figure ^ consists of two 
segments, each realized as a sentence. One consequence 
is that the goal of treating the compound rib fracture 
is now distributed across both segments. We arbitrarily 
limit combined messages to three segments in order to 
maintain a goal-oriented organization, as dictated by the 
purpose of our messages. 

The next three components of our metric measure how 
well a merge achieves concise, unambiguous, and eas- 
ily assimilated messages (as required by the situation in 
which the messages will be received). The reduction in 
repetition of actions contributes both to concise mes- 
sages and decreasing ambiguity; however, achieving this 
often requires the repetition of goals, which detracts from 
the succinctness of the message. We hypothesize that 
a few coherent messages will be more easily assimilated 
than many individual messages and thus our metric takes 
into account the number of individual critiques that are 
merged into the resultant message. 

Figure ^ illustrates the text plans underlying the in- 
dividual critiques: 

Caution: do < >, < A2 >, and < A3 > to 

< Gl >. 

Caution: do < Ai >, < A2 >, < A3 >, and < A4 > 
to < G2 >■ 

while Figure ^ illustrates the text plan produced by 
TraumaGEN for the merged message 

Do < Aq > as part of < Gi > and < Ai > as part 
of < G2 >. Next do < > and < ^43 > to address 
both of these goals. Then do < A4 > to complete 

< G2 >. 

For reasons of efficiency and real-time response, Trauma- 
GEN applies its transformational rules directly to the 
logical form of the critiques produced by TraumaTIQ. 
An example of an actual merged message produced by 
TraumaGEN is shown in Figure |l|. 

Trailing Comments When several critiques are 
merged into a single message, the message may refer 
to actions that are also part of critiques that did not 
participate in the merge. Once these actions have been 
introduced in the merged message, discourse theory, par- 
ticularly work on focusing heuristics (McKeown, 1985), 
suggest that the other critiques referencing these actions 
be included in the merged message as well. However, 
rather than restructure the result of our merge trans- 
formation, we append them to the end of the message. 
Thus we refer to them as trailing comments. 

Unfortunately, trailing comments have the potential to 
erroneously suggest new instances of actions. Our solu- 
tion to this problem is to (1) make the previously focused 



(Consider User (AO, Al, A2, A3, A4)) 




(Recommend System User (AO)) (Persuaded User (AO)) (Recommend System User ( Al |) (Persnaded User I Al )) 



(Bel User (Part-pf | AO) (Gl ))) 

fN 

(Inform System User (Part-of | AO | | G 1 1 )) 
as pari of Gl 



(Bel User (Part-of (Al ) (G2))) 
N 

(Inform System User (Part-of (Al | {G2|)) 
as part of G2. 



(Inform System User (Part-of |A2,A3)(G1,G2))) 
to address both of these goals. 



(Inform Systein User (End (A4) |G2))) 
to complete G2. 



Figure 3: Text Plan for First Merged Message 



action the subject of the sentence, reflecting its given sta- 
tus in the discourse, and (2) utilize cue words to call ad- 
ditional attention to its occurrence earlier in the message 
and to the new information being conveyed. Thus the 
first trailing comment is introduced with the cue word 
moreover since this cue word carries the implication of 
saying more about something already discussed, and the 
cue word also is used to introduce the additional infor- 
mation. In the following critique, for example, the final 
sentence (underlined for exposition) contains a trailing 
comment: 

Check for distended neck veins and decreased breath 
sounds to assess the possibility of a left ten- 
sion pneumothorax and a pericardial tamponade. 
Then check for muffled heart sounds and con- 
tinued shock to complete assessing the possibility 
of a pericardial tamponade. Moreover, checking 
for muffled heart sounds is also indicated to assess the 
possibility of a pericardial injury. 

If there is a second trailing comment, it is then intro- 
duced with the cue phrase in addition since it suggests 
presenting new information of a similar nature, in this 
case another reference to a previously introduced action. 

A trailing comment may need to refer to other actions 
in addition to the one previously focused on. We ac- 
complish this by subordinating those actions in a phrase 
introduced by the cue phrase along with, in a sentence in 
which the previously focused action is the subject. For 
example, in Figure ^ the fourth critique is not included 
in the merged message but includes an action {doing a 
laparotomy) that is part of the merged message. Thus 
the fourth critique is realized as a trailing comment at 
the end of the merged message: 

Moreover, doing the laparotomy is also indicated, 

along with repairing the left diaphragm, to treat the 

lacerated left diaphragm. 

Revising Interacting Critiques In TraumaTIQ's 
original output, we noticed that one critique could de- 
tract from another critique, although each was both 
justified and coherent in isolation. Consider, for ex- 



Performing local visual exploration of all abdominal 
wounds is preferred over doing a peritoneal lavage for 
ruling out a suspicious abdominal wall injury. 

Please remember to check for laparotomy scars before 
you do a peritoneal lavage. 

Figure 4: Two Conflicting Critiques 



ample, the two critiques shown in Figure The first 
cautions the physician that a procedure other than the 
just-ordered peritoneal lavage is the recommended pro- 
cedure in this instance, although the disparity is not 
critical. The second reflects the fact that a peritoneal 
lavage should not be done on someone with abdominal 
scarring. Since TraumAID does not yet have any infor- 
mation about the presence of abdominal scarring in this 
patient, the critique reminds the physician of the need 
to first check for it. 

However, together the two critiques appear incoherent. 
This can be more pronounced when the two critiques are 
separated by other comments, since the second critique 
gives the impression that a peritoneal lavage will be per- 
formed. In some situations, such as tutoring, it might 
be appropriate to discard the second critique. However, 
in real-time decision-support, this may be inappropriate 
since it presumes that the system is the sole arbiter of 
high-quality performance whose advice must be followed. 

Our solution to this problem is to allow revision rules 
that are triggered when two critiques have a potential 
conflict. In such cases, the two critiques are merged into 
a single message, and the conflicting critique is revised. 
For example, our rule Revise- Conflict is triggered when- 
ever a critique whose goal is that an action be properly 
scheduled occurs with a critique whose intention is that 
the action be replaced with one more highly preferred. 
Revise- Conflict merges the two critiques into a single 
message, where the merged message reflects a conces- 
sion that the original action might still be executed and 



(Consider User (LVE)) 




(Recommend System User { LVE } ) (Persuaded User { LVE } ) 
(implicit) I N 

(Motivation {LVE) PrefProc) 
N 

(Bel User (Preferred (LVE) Per-Lav {Abd-W-Inj ))) 

[n 

(Inform System User (Preferred {LVE) Per-Lavage {Abd-W-Inj)) 

Performing local visual exploration of all abdominal 
wounds is preferred over doing a peritoneal lavage 
for ruling out a suspicious abdominal wall injury. 



However 



(Concession System User Per-Lav) 



(Inform System User (Do User Per-Lav)) 
you do a peritoneal lavage^ 



then 

(CoiTect-Exec User Per-Lav) 
I 

(Condition {Lap-Scar} Per-Lav) 
N 

(Inform System User 

(Condition {Lap-Scar} Per-Lav)) 

remember to first check for 
laparotomy scars 



Figure 5: Text Plan for Revised Message 



the scheduling critique is revised so that it is condition- 
ally dependent on the original action being done. For 
example, in the case of the critiques in Figure ^, the two 
critiques would be revised and merged into the single 
message 

Performing local visual exploration of all abdominal 
wounds is preferred over doing a peritoneal lavage for 
ruling out a suspicious abdominal wall injury. How- 
ever, if you do a peritoneal lavage, then remember to 
first check for laparotomy scars. 

Figure |^ illustrates the text plan underlying the revised 
message. Note that the new message still recommends 
the better procedure, but leaves the final choice with 
the physician who is responsible for quality patient care. 
On the other hand, if the second critique in Figure ^ ap- 
peared by itself, no revision of the message would occur. 

Another rule, Revise-Interactions, is triggered when a 
critique whose goal is to postpone a dependent action 
occurs in conjunction with a critique whose goal is exe- 
cution of the action on which the dependency is based. 
For example. Figure]^ presents two of TraumaTIQ's cri- 
tiques. While the two critiques do not conflict, the rela- 
tion between their communicative goals can be exploited 
to produce a more concise and coherent message. Revise- 
Interactions establishes a Sequence relation between do- 
ing the peritoneal lavage and the decision about whether 
to do the reassessment, and produces the message 
Do a peritoneal lavage immediately as part of ruling 
out abdominal bleeding. Use the results of the peri- 
toneal lavage to decide whether or not to reassess the 
patient in 6 to 24 hours. 

Other Influences on Effective Messages 
The Role of Focus Focus (Grosz, 1977; McKeown, 
1985) has been the objective of much discourse research, 
and it plays several roles in our generation of messages. 
As noted earlier, trailing comments capture communica- 
tive goals that relate to previously mentioned actions, 
and cue words are used to shift focus back to the earlier 



Caution: do a peritoneal lavage immediately as part 
of ruling out abdominal bleeding. 

Do not reassess the patient in 6 to 24 hours until after 
doing a peritoneal lavage. The outcome of the latter 
may affect the need to do the former. 

Figure 6: Two Dependent Critiques 



actions, hi addition, if there is more than one trailing 
comment, they appear in order of the most recently in- 
troduced action, thus representing successive pops of a 
focus stack. 

Focus also affects the way in which some communica- 
tive goals are realized in messages. For example, if a 
goal of getting the user to recognize several scheduling 
constraints is the sole content of a message, it would be 
realized with the subordinate clause first to call attention 
to the ordering constraint, as in the following: 

Before getting the urinalysis, insert the left chest tube 
and get the post chest tube x-ray because they have a 
higher priority. 

However, if the physician has omitted some of the ac- 
tions and the scheduling constraint is incorporated into 
the text plan for getting the physician to do the omitted 
actions, then focus considerations dictate that the main 
clause appear first since it continues the actions in fo- 
cus. The following is such an example produced by our 
system: 

Check for medication allergies, give antibiotics, set 
up the auto-transfuser for the left chest tube, in- 
sert a left chest tube, and get a post chest tube x- 
ray to treat the simple left hemothorax. Insert the 
left chest tube and get the post chest tube x-ray before 
doing the peritoneal lavage because they have a higher 
priority. 



Definite Versus Indefinite References Critiques or 
any other message from the system should be phrased 
in terms of what is shared knowledge in the emer- 
gency room. We equate shared knowledge with the cur- 
rent state of the case, as it has been entered into the 
computer-based medical record (CBMR). When a pro- 
cedure is ordered, it thus becomes part of this shared 
knowledge. Consequently, we use definite articles to re- 
fer both to procedures and actions already introduced 
into the treatment plan by one of the system's messages 
and to entities introduced via the scribe nurse's entry 
of a procedure or action into the CBMR. For example, 
even though a peritoneal lavage does not appear in any of 
the system's earlier messages, a message about a related 
scheduling precondition will be realized as: 

Please remember to check for laparotomy scars before 
you do the peritoneal lavage. 

However, the system may disagree with the physician 
about whether a procedure is appropriate. Since the use 
of the definite article suggests an action's acceptance into 
the treatment plan, we use indefinite expressions when 
referring to procedures about which there is conflict. For 
example, if the physician has ordered a peritoneal lavage 
and the system believes that the need for it is dependent 
on the results of a chest x-ray, the system would generate 
the message 

Do not do a peritoneal lavage until after getting a 
chest x-ray since the outcome of the latter may affect 
the need to do the former. 

Evaluation 

While the final evaluation of TraumaGEN's effectiveness 
can only be measured by deploying it in a trauma bay 
and evaluating the degree to which its messages change 
physician behavior, preliminary evaluation can be used 
to determine its benefits and limitations and to identify 
where further work is needed. We ran TraumaGEN on 
48 collected cases of actual trauma care under a scenario 
in which critiques were produced after each physician 
order. 

We compared the critiques generated by TraumaTIQ 
alone with the messages produced when it was aug- 
mented with TraumaGEN. We found that TraumaGEN 
reduced the number of messages, resulted in more con- 
cise messages (measured in terms of the number of noun 
phrases in a message) , and required fewer shifts in focus 
to assimilate the messages. 

To evaluate coherence and quality of the messages, we 
asked a human subject not affiliated with our project to 
evaluate the new messages with respect to the original 
ones. The subject was given the messages produced by 
TraumaTIQ and TraumaGEN for a dozen cases. In ten 
of the twelve cases, the human subject preferred Trauma- 
GEN's messages; in eight of these cases, the preference 



was very strong while in the other two cases it was mod- 
erate. In the single instance in which the subject pre- 
ferred the original messages produced by TraumaTIQ, 
the preference was based on the English translation of 
two goals; the subject found the phrasing confusing when 
the messages were combined (since the two goals had 
very similar translations) but not confusing when the 
messages were separated. The subject's comments in- 
dicated that his preferences for TraumaGEN's messages 
were generally based on reduction of repetition, merging 
of related messages, and elimination of conflict. 

Conclusion 

This paper has presented our message planner, Trauma- 
GEN, that draws on rhetorical structure and discourse 
theory to produce integrated messages from individual 
critiques each of which is designed to achieve its own 
communicative goal. The need to construct coherent 
text from multiple individual text plans is a problem 
that will increasingly face natural language systems as 
sophisticated systems distribute their processing across 
individual modules each of which may need to commu- 
nicate with the user. TraumaGEN takes into account 
knowledge about the purpose of the messages, the situa- 
tion in which the messages will be received, and the social 
role of the system. Preliminary evaluation of Trauma- 
GEN indicates that it successfully constructs coherent 
integrated messages from individual critiques and that 
the resultant messages are comprehensible and a signif- 
icant improvement over the original critiques. 
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